Chat mining: Predicting user and message attributes in computer-mediated communication

نویسندگان

  • Tayfun Kucukyilmaz
  • Berkant Barla Cambazoglu
  • Cevdet Aykanat
  • Fazli Can
چکیده

The focus of this paper is to investigate the possibility of predicting several user and message attributes in text-based, real-time, online messaging services. For this purpose, a large collection of chat messages is examined. The applicability of various supervised classification techniques for extracting information from the chat messages is evaluated. Two competing models are used for defining the chat mining problem. A term-based approach is used to investigate the user and message attributes in the context of vocabulary use while a style-based approach is used to examine the chat messages according to the variations in the authors’ writing styles. Among 100 authors, the identity of an author is correctly predicted with 99.7% accuracy. Moreover, the reverse problem is exploited, and the effect of author attributes on computer-mediated communications is discussed. 2008 Elsevier Ltd. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IMPACT OF SYNCHRONOUS COMPUTER-MEDIATED COMMUNICATION ON EFL LEARNERS’ COLLABORATION: A QUANTITATIVE ANALYSIS

For the last two decades, computers have entered people’s lives in an unprecedented manner in a way that almost everybody considers life without them rather impossible. In recent years, researchers and educators have been trying to discover how computers and the Internet technology can maximize the quality of language instruction. As such, the present experimental study sought to investigate th...

متن کامل

A Linguistic Analysis of the Online Debate on Vaccines and Use of Fora as Information Stations and Confirmation Niche

This study looks at the communication between users concerning health risks, with the aim of exploring their use of fora and assessing whether participants establish a niche with like-minded users during these exchanges. By integrating a corpus linguistic approach with content analysis and multiple studies on computer mediated health discourse, this study analyses the intense attention paid to ...

متن کامل

Computer Mediated Communication for the Enhancement of Psychotherapy

Psychotherapy and psychosocial interventions rely mainly on verbal communication and language. With the expanding use of computers as well as the capacity of the Internet to bridge geographic distances and to increase access, computer mediated communication (CMC) plays an increasing role for the delivery of psychosocial interventions. This chapter focuses on the advantages and limitations of CM...

متن کامل

Bringing Round-Robin Signature to Computer-Mediated Communication

In computer-mediated group communication, anonymity enables participants to post controversial comments without risking accusations of improper behavior. While this may encourage more open and frank discussion, it diminishes accountability. In addition, anonymous comments are perceived as weaker than non-anonymous comments. We propose a communication protocol that allows a user to send a strong...

متن کامل

Information Overload in Group Communication: From Conversation to Cacophony in the Twitch Chat

Online communication channels, especially social web platforms, are rapidly replacing traditional ones. Online platforms allow users to overcome physical barriers, enabling worldwide participation. However, the power of online communication bears an important negative consequence — we are exposed to too much information to process. Too many participants, for example, can turn online public spac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Process. Manage.

دوره 44  شماره 

صفحات  -

تاریخ انتشار 2008